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METHOD FOR PERFORMING ERROR CORRECTIONS OF DIGITAL 
INFORMATION CODIFIED AS A SYMBOL SEQUENCE 

5 Priority Claim 

[1] This application claims priority from European patent application 
No. 03425172.8, filed March 19, 2003, which is incorporated herein by reference. 

Technical Field 

10 [2] In its more general aspect, the present invention relates to a method for 
applying the self-corrector code theory to digital information coded as symbol 
sequences, for example in the Boolean logic, stored in electronic memory systems 
or transmitted from and to these systems. 

[3] More particularly, the invention relates to a method as above providing the 
15 transmission of sequences incorporating a portion of error corrector code allowing 
the sequence, which is more probably the original transmitted through the 
calculation of an error syndrome by using a parity matrix, to be restored when 
received. 

Background 

20 [4] In the specific technical field of communication systems, it is well known that 
any message comprising digital information can be processed and transferred from 
a system to another through electronic communication means which might be 
affected by noise. 

[5] In substance, a sequence x of Boolean symbols transmitted through a 
25 communication channel undergoing noise can be received as a different sequence 

1 



03-AG-077/AI; SCH069BUS/BF/jf; 2110-109-3 



y from which it is necessary to go back to the initial sequence x. 

[6] Traditionally, the sequence x of symbols to be transmitted comprises an 
additional or redundant portion including an error corrector code allowing the 
message, which is more probably the original even with errors, to be restored when 
5 received. 

[7] These error corrector codes are based on well known mathematical theories, 
such as for example the Hamming code theory, which are presently applied in 
several contexts wherein it is necessary to remedy noise in communication 
channels. 

10 [8] For a better understanding of all aspects of the present invention, a detailed 
description of the most used methods for correcting errors in digital information 
coded as symbol sequences in the Boolean logic is illustrated hereinafter. 

0.1 Basic definitions 

15 

[9] Definition 1 Given nrvn real numbers, a table like the following one is called 
matrix of the type [m x n]: 

^ an an • - - ^ 

d>2! «22 " l U'2» 

M = 

[10] Definition 2 The transpose of the above matrix, indicated with M T , is the 
matrix: 

^ an <*2i - ' ' a™ i ^ 

#12 rt 22 * * ' ftm2 
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obtained from M by exchanging, in order, rows with columns. 

[11] Definition 3 A n-x-n-order square matrix M is considered. Fixing an element 
a ik of the matrix M and eliminating therein the row and the column crossing in the 
element (the i-th row and the k-th column) a square matrix of order (n-1) x (n-1) is 
5 obtained, whose determinant is called complementary minor of aj k and will be 
indicated with M ik . 

[12] Definition 4 The determinant of the second order matrix is the number: 

an822 - 312321 

[13] Definition 5 The determinant of a n-order matrix is: 

±a ik -{-\T k M ik 
10 *-» 

[14] Definition 6 The square matrix having 1 as elements aii and 0 elsewhere is 
called identity matrix and is indicated with I. 

[15] Definition 7 A group G is a set in which an operation * is defined, for which 
G is closed for *, i.e. if g e G and heG=>g*heG; 
15 * is associative; 

G has the identity, i.e. 3 and e G so that e*g = g*e = gVg <= G; 
Vg g G the inverse exists, i.e. 3g- 1 e G so that g- 1 * g = g * g- 1 = e. 

[16] Definition 8 If the operation * is the sum the group is called additive 

[17] Definition 9 A group is called abelian if the operation * is commutative 

20 [18] Definition 10 The set <i 0,1,2, p-1 \ is called remainder class (mod p) and 
is indicated with Z p , the property being that in these classes p = identity. 

[19] Definition 1 1 A Boolean group is a binary group, i.e. a group containing only 
the numbers 0 and 1 and 1 + 1=0. 

[20] Definition 12 A set of vectors v if v k is linearly dependent if and only if 
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there are some scalars c1, c k * 0 so that ciVi+c 2 V2+...+CkV k =0. 

[21] Definition 1 3 A family of vectors is called base of the area if it is a generating 
family, i.e. any other vector of the area is a linear combination of these vectors, and 
it is composed of linearly independent vectors. 

5 

0.1.1 Codes 

[22] The aim of the self-corrector code theory, a branch of the information theory, 
was originally born to solve some practical problems in the communication of coded 

10 digital information. A message is considered as a block of symbols of a finite 
alphabet; it is usually a sequence of 0 and 1 but it can be also any number, a letter 
or a complete sentence. The message is transmitted through a communication 
channel undergoing a noise. The aim of the self-corrector code theory is to add 
redundant terms to the message so that it is possible to go back to the original 

15 message if the transmitted message has been damaged. First of all, a difference 
must be made between diagnosing and correcting errors. Diagnostics detects the 
presence of an error, while the correction detects and corrects the error. 

[23] Each message called c consists of k information digits. The coding turns, 
according to certain rules, each input message c into a binary nth number x with n 
20 > k. 

[24] This binary nth number x is the code word of the message c. During the 
transmission some errors can occur, the binary nth number y being thus received 

c — > x -> channel — > y 

[25] The area V of all nth numbers of 0 and 1 will be now considered adding 
25 component vectors per module component 2. 

[26] Definition 14 A linear binary code [n,k] is the set of all linear combinations of 
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k(*0) independent vectors in V. Linear means that if two or more vectors are in the 
code, also their sum is therein. 

[27] Definition 15 A generating matrix G for a linear code is a matrix k x n whose 
rows are a base for C. 

5 [28] Definition 16 A parity matrix H of a linear code is a matrix n x k so that G 
H=0. 

[29] Definition 1 7 H is the parity matrix of a code C w e C if and only if wH T =0. 

[30] Definition 18 G is called in standard form if G = (l k P) where l k is the identity 
matrix k x k and P is a matrix k x (n-k). If G is in the systematic or standard form, 
10 then the first k symbols of a word are called information symbols. 

[31] Theorem 19 If a code C [n,k] has a matrix G = (l k P) in the standard form, 
then a C parity matrix is H = (-P T l n -k) where P T is the transpose of P and is a matrix 
(n-k) x k and l n -k is the identity matrix (n-k) x (n-k) 

[32] Systematic codes have the advantage that the data message is in the code 
15 word and it can be read before decoding. For codes in the non-systematic form the 
message is no more recognizable in the coded sequence and an inverter is needed 
to recognize the data sequence. 

[33] Definition 20 Being C a linear code with parity matrix H, then, given x a 
binary nth number xH T , is called syndrome of x. 

20 [34] Definition 21 The weight of a vector u is the number of component being 
different from 0. 

[35] Definition 22 The code minimum weight d is the weight of the vector different 
from 0 having the lowest weight in the code. 

[36] d is thus a measure of the "quality" of a code. 
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[37] Defined a sphere S r (u) with radius r around a vector u like 
S r (u) = {yeV | 6(i±v) < r} 

[38] Theorem 23 If d is the minimum weight of a code C, then C can correct at 
~ d-\ 

most t = L 2 J errors and vice versa. 

5 [39] Corollary 24 C has a minimum weight d if d is the highest number so that 
each d-1 columns of the parity matrix H are independent. 

[40] Supposing for example that a code in the systematic form correcting 2 errors 
is to be produced. The matrix H will be composed of the identity matrix and of a 
matrix P T having 4 linearly independent columns, i.e. so that the determinant of the 
10 sub-matrix composed of these four columns * 0. Therefore, according to the 
number of errors to be corrected, a matrix H with d-1 linearly independent columns 
is searched. Therefore, given n and k, a code with d being the widest possible is 
searched in order to correct more errors. 

[41] It is however possible to have vectors in V which are not comprised in any of 
1 5 these spheres. 

[42] Definition 25 A minimum-weight-d code C is called perfect if all vectors in V 

~ d-\ 

are comprised in spheres of radius t = L 2 J around the code words. In this case 
it can be said that the spheres cover the area. 

For the given n and k they are the best codes. 

20 [43] Theorem 26 For a perfect binary code [n,k] to exist, n, k and t must satisfy 
the following equation 
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Generally, 

[44] Theorem 27 For a code [n,k] to exist, n, k and t must satisfy the following 
inequality known as Hamming inequality: 



[45] When the word y is received the word x being sent and afterwards the data 
5 message c are to be searched. With the following formula: y = x+£ t => H(m+£ t ) = H£ t 
where £t is a particular error class. If H^ t e H, then it can be said which is the wrong 
position. 

[46] Supposing that an error occurs: 
m + £j=> H(m+^i) = H^i 
10 H^j e H? ^ wrong position: i 

[47] Supposing now that two errors occur: 
m + £ + % => H(m+^+ §,) = H^i + H^j = s 
V^j^H^i + H^j e H? — > wrong positions: i and j 

[48] The following practical example for corrector codes of one error (Hamming 
15 codes) is now examined: the Hamming code [7,4] described by the following 
generating matrix is considered: 




/ l o o o o l i \ 



6 T = 



0 10 0 10 1 



0 0 10 110 



\ 0 0 0 1 1 1 1 / 
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[49] The first 4 positions are considered as the information positions and the last 
3 positions as redundancy positions. Therefore the first row is the message 10 0 0 
and so on. All words are obtained by adding (mod 2) those rows. For example the 
message u = (1 01 1 ) is coded as x = 



5 (1 01 1010). The parity matrix H is considered: 

[50] It must be noted that the matrix columns have been written so that the i-th 
column is composed of 2-based i-development coefficients, in case completed by 
0. 

[51] Supposing to send the message x above and that an error occurs. The 
10 message y = (1010010) is thus received. The syndrome is calculated: 



(1 0 0) is the binary representation of 4; the wrong bit is therefore the fourth. 

[52] The ideal is thus to search perfect codes, but they are not always found, 
moreover codes recognizing an error of the 0 ^1 type from 1^0 are wished. 

15 [53] Although advantageous under many aspects, the methods presently used 
require adding a redundancy information portion which, the size of the single 
message to be coded being fixed, cannot be lower than a minimum indicated. A 
technical problem underlying embodiments of the present invention is to provide a 
linear code protecting digital information coded like binary symbol sequencesand 

20 overcoming the limits of the solutions presently provided by the prior art. 




Hy T = (100) 
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Summary 

[54] According to one aspect of the invention, a coding is identified for a binary 
alphabet in non Boolean groups, i.e. in non binary groups. 

[55] On the basis of this solution idea the technical problem is solved by methods 
5 according to other spects of the invention as previously indicated and as defined in 
claim 1 and the following. 

[56] The features and advantages of methods according to aspects of the 
invention will be apparent from the following description of embodiments thereof 
given by way of non-limiting example. 

10 

Detailed Description 

[57] A method according to an embodiment of the invention is now described in 
detail, which applies the self-corrector code theory to digital information coded as 
symbol sequences. 

15 [58] More particularly, a method according to one embodiment of the invention 
allows error corrections to be performed on digital information coded as symbol 
sequences x, for example digital information stored in electronic memory systems 
or transmitted from and to these systems and providing the transmission of 
sequences x incorporating an error corrector code portion allowing the sequence x, 

20 which is more probably the original transmitted through the calculation of an error 
syndrome using a parity matrix, to be restored when received. 

[59] Advantageously, the method provides that the error code incorporated in the 
original sequence x belongs to a non Boolean group. 

[60] The error code used is a linear code, as it will be apparent from the following 
25 detailed description of the method embodiments. 

9 
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[61] 0.2 Codes on different groups 

[62] Additive groups are considered. The group of operation with the previous 
codes is Boolean, i.e. being x a field element it results that x + x = identity with 
respect to the sum. Now additive groups are considered (mod p) with p e N. 

5 [63] Similar codes to the above-described codes are searched, i.e. codes for 
which, being H the code parity matrix and y the received word it results: 



if y is a code word. Linear codes are thus searched. Moreover if y is affected by 
one or more errors, it results: 



where Sj and Sj are the i-th and j-th columns of the matrix H T . The code being 
searched must therefore belong to an Abelian group to have this property. 

[64] Codes in a systematic form are searched and the method for forming the 
identity matrix is analyzed. Columns are considered as 1 0-base-written numbers. 

15 The matrix will then become a number vector and the product matrix by message 
received will become a scalar product. Operating in a group (mod p) the numbers 
composing the identity matrix must be such that the matrix composed of their 
binary representation has a determinant * 0. The parity bit number n - k being 
fixed, p is chosen so that: 

20 2 n " k + 1 < p < 2 n " k+1 - 1 

[65] The identity matrix is composed of the numbers p-1, p-2 p-2 n "\ A code C 

[7,4] with p = 8 is considered, the identity matrix will be composed of the numbers 
7, 6 and 4. The binary-written matrix will then have the form: 
opposite to the usual identity — / x 



y H T = 0 



10 
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represented by the 1 0-based numbers: 1 , 2 and 4 

[66] It must be noted that any matrix could be chosen, having a "determinant" * 
0, i.e. a number belonging to that matrix is not a linear combination of other 
numbers belonging to that matrix. This choice is particularly effective. It can be 
seen with an example. 

[67] Supposing that the product of a data vector by a certain matrix P (H = (P,l)) 
has given the result 1, which, binary-written as 100, will compose the code part to 
be added to the word, m is seen as a weight vector ci; thus being xi the numbers 
composing the matrix H (seen as a vector): 

n 

mH = Y J c i x i 

'=* ci = 0,1 

[68] Where the sum is done (mod p). When the message is received, the 
multiplication m ■ H T must occur, i.e. ( mk , m n -k) • (P,l) = m k • P + m n -k • I- In this case 
the first value is 1 and so that the message is correct it must be: 

1 + m n ' k ■ I = 0 (mod p) 

[69] The usual matrix i.e. [1 ,2,4] is chosen as identity matrix. It results: 

[1,2,4] (ci,c 2 , c 3 ) + 1 = 0 

[70] Working in a field Zs, instead of having 0 as second member, 8k can be 
obtained with keN. The solution is (ci, C2, C3) = (111). 

[71] The suggested matrix, i.e. [7,6,4], is now chosen as the matrix. It results: 

[7,6,4] (ci,c 2 , c 3 ) + 1 = 0 

[72] The solution is (ci, c 2 , C3) = (100), i.e. the same value as the calculated code. 
This fact is not random, with the identity matrix suggested the calculated code is 
always equal to the code received if errors have not occurred. 

[73] The numbers composing the parity matrix P columns must be chosen 
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according to similar criteria to those of the Boolean group. 

[74] With codes in these groups the error 1 -> 0 is distinguished from 0 1 , thus 
the channel is no more symmetrical. In fact: 

if the syndrome returns a value x with x e H the error occurred is 0 -> 1 ; 

5 if the syndrome returns a value x with x e H, but p - x e H, then the error occurred 
is 1 -> 0; 

[75] An error +1 is allocated to the first case and an error -1 to the second case. 

[76] A code [6,1] with p = 22 is considered. 

H = (11|21 20 18 14 6) 

10 In binary this matrix will be: 



/ 


1 


1 


0 


0 


0 


0 






1 


0 


0 


1 


1 


1 






0 


1 


1 


0 


1 


1 






1 


0 


0 


0 


1 


0 




\ 


0 


1 


1 


1 


0 


0 


) 



[77] The code words will then be: 

0|00000 
1|11010 

[78] The second code word is sent, but 111110 is received, i.e. an error +1 has 
15 occurred in the fourth position. Calculating: (111110) • H = 1 • 11 + 1 -21 +1 • 20 + 
1-18+1-14 = 84 which, in the group considered, is 18. 18 is in the matrix H and 
thus the error occurred is 0 -> 1, moreover 18 is in the fourth position of the matrix, 
which is the wrong message position. 

[79] Supposing now that 101010 is received, i.e. an error -1 has occurred in the 
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second position. It must be calculated: (101010) ■ H = 1 -11 +1 -20 + 1 • 14 = 45 
which, in the group considered, is 1. 1 is not in the matrix H, but 22-1 = is therein 
and therefore the error occurred is 1 -> 0, moreover 21 is in the matrix second 
position which is the wrong position in the message. 

5 [80] It must be observed that the errors in the message received can be only of 
one type, or +1 or -1 in each position, if the corresponding bit is 0 or 1 in the 
message received. If an impossible error is detected, it means that the code could 
diagnose but not correct the errors. 

[81] A contradictory example is now described. 

10 [82] A code [3,1] in a group (mod 4) is considered, in which the matrix H = (1|32). 
The code words will be: 

0|00 1|10 

[83] The message 000 is sent and 010 is received. 

(010) ■ H = 3 

15 3 is in the matrix and this would indicate an error +1 in the second position. 4 - 3 = 
1 is also in the matrix and this would indicate an error -1 in the first position. In fact 
010 can be obtained also from 110 with an error in the first position. Therefore a 
code cannot be found on Z 4 . Sometimes, in order to correct the errors, it is 
necessary not only to calculate the syndrome but also to compare the bits received. 

20 A code [3,1] is considered on Z 5 with matrix H = (3|43). The code words will be: 

0|00 1|11 

[84] The word 000 is sent, all errors which may occur and the decoding are 
considered. 

001 =^> syndrome = 3. Possible errors: 
25 1) +1 in the first position; 

2) +1 in the third position; 

13 
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[85] Given that a 0 is received in the first position, the case 1 is not possible. 

010 => syndrome = 4. Possible error: +1 in the second position. 

100 => syndrome = 3. Possible errors: 

1 ) +1 in the first position; 
5 2) +1 in the third position; 

[86] Given that a 0 is received in the third position, the case 2 is not possible. 
The word 111 is now sent, all errors which may occur and the decoding are 
considered. 

01 1 => syndrome = 2. Possible errors: 
10 1 ) -1 in the first position; 

2) -1 in the third position; 

[87] Given that a 1 is received in the third position, the case 2 is not possible. 

101 => syndrome = 1 . Possible error: -1 in the second position. 
1 10 => syndrome = 2. Possible errors: 

15 1 ) -1 in the first position; 

2) -1 in the third position; 

[88] Given that a 1 is received in the first position, the case 1 is not possible. 

[89] Therefore the type of error occurred is distinguished by comparing the 
syndrome with the values actually received. The manufacture of a circuit describing 
20 this method involves the creation of non-binary adders, even if they operate with a 
frequency of 0 and 1 (writing each number with the binary representation). If for 
example operation is made on Z5, the adder must be able to say that (100) + (100) 
= (010), i.e. 1 * 1 = 2, but (110) + (010) = (000), i.e. 3*2 = 5. Moreover it must be 
possible to find the complement of a number which will be searched in the matrix. 

25 [90] The error correcting code methodology described herein may be utilized in a 
variety of different types of electronic systems, such as communications, digital 
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video, memory and computer systems, as will be appreciated by those skilled in the 
art. 

[91] From the foregoing it will be appreciated that, although specific embodiments 
of the invention have been described herein for purposes of illustration, various 
5 modifications may be made without deviating from the spirit and scope of the 
invention. 
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